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Abstract 

Consider the following stochastic graph process. We begin with Go, the empty graph on 
n vertices, and form Gi by adding a randomly chosen edge to where is chosen 
uniformly at random from the collection of pairs of vertices that neither appear as edges 
in nor form triangles when added as edges to G,_i. Let the random variable M be 
the number of edges in the maximal triangle free graph generated by this process. We 
prove that asymptotically almost surely M = 0(n 3 / 2 \/logn). This resolves a conjecture 
of Spencer. Furthermore, the independence number of Gm is asymptotically almost 
surely 0(\/nlogn), which implies that the Ramsey number i?(3, t) is bounded below by 
a constant times £ 2 /logi (a fact that was previously established by Jeong Han Kim). 
The methods introduced here extend to the 7\4-free process, thereby establishing the 
bound i?(4,i) = Q(t 5 / 2 /\og 2 t). 

1 Introduction 

Consider the following constrained random graph process. We begin with the empty graph 
on n vertices, which we denote Go- At step i we form the graph G% by adding an edge to 
Gi-i chosen uniformly at random from the collection of pairs of vertices that neither appear 
as edges in Gj_i nor form triangles (i.e. copies of K3) when added as edges to Gi-\. The 
process terminates with a maximal triangle- free graph on n vertices, which we denote Gm 
(thus the random variable M is the number of steps in the process). We are interested in 
the likely structural properties of Gm as n tends to infinity; for example, we would like to 
know the value of M and the independence number of Gm. 

The study of this graph processes began by the late 1980's (see Bollobas [6]). The first 
published result on a process which iteratively adds edges chosen uniformly at random from 
the collection of potential edges that maintain some graph property is due to Ruciriski and 
Wormald, who answered a question of Erdos regarding the process in which we maintain a 
bound on the maximum degree [20] . Erdos, Suen and Winkler considered both the triangle- 
free process and the odd-cycle- free process [11]. The H-free process where H is a fixed 
graph was treated by Bollobas and Riordan [7] as well as Osthus and Taraz [18]. While 
these papers establish interesting bounds on the likely number edges in the graph produced 
by the H-iree process for some graphs H, for no graph H that contains a cycle has the 
exact order of magnitude been determined. 
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Another motivation for the triangle-free process comes from Ramsey theory. The Ram- 
sey number R(k, £) is the minimum integer n such that any graph on n vertices contains a 
clique on k vertices or an independent set on I vertices. The Ramsey numbers play a central 
role in combinatorics and are the subject of many notoriously difficult problems, most of 
which remain widely open (see, for example, [12] [8]). One problem regarding the Ramsey 
numbers that has been resolved is the order of magnitude of R(3, t) as t tends to infinity: 
Ajtai, Komlos and Szemeredi proved the upper bound R(3,t) = 0(t 2 / logt) and Kim es- 
tablished the lower bound R(3,t) = n(t 2 /logt). (There were a number of significant steps 
over the course of about 30 years that led up to these final results; see [9], [10], [13], [22], 
[21], [11], [17], [23].) The problem of determining the asymptotic behavior of R(3,t) was 
one of the motivations for the introduction of the triangle-free process. Indeed, we establish 
a lower bound of the form R(3, t) > n by proving the existence of a graph on n vertices 
with neither a triangle nor an independent set on t vertices, and the triangle-free process 
should produce such a graph as it should include enough 'random' edges to eliminate all 
large independent sets. In a certain sense, Kim's celebrated result verified this intuition 
as he used a semi-random variation on the triangle- free process. (This was an application 
of the powerful Rodl nibble that was inspired by an approach to the triangle-free process 
proposed by Spencer [23].) However, the problem of whether or not the triangle- free process 
itself is likely to produce a Ramsey i?(3, t) graph remained open. 

Our main results (Theorems 4 and 5 below) have the following Corollaries. 

Theorem 1. Let the random variable M be the number of edges in the graph on n vertices 
formed by the triangle-free process. There are constants c±,C2 such that asymptotically 
almost surely we have 

ci y/\og n ■ n 3 / 2 < M < c 2 y / logn • ra 3//2 . 

Theorem 2. There is a constant C3 such that the following holds: If n = n(t) < C3 -t 2 /logt 
then a.a.s. the triangle-free process on n vertices produces a graph with no independent set 
of cardinality t. Thus R(3,t) > C3 • for t sufficiently large. 

Theorem 1 proves a conjecture of Spencer [23]. Theorem 2, which establishes that the 
triangle-free process is an effective randomized algorithm for producing a Ramsey R(3, t) 
graph, is a direct consequence of Theorem 5 below. These results reveal a visionary aspect 
of the 1961 paper of Erdos [9] which established the bound R(3, t) = f2(t 2 / log 2 t). When the 
probabilistic method was in its infancy, Erdos established his bound by analyzing a greedy 
algorithm applied to a random graph, and it turns out that a random greedy algorithm 
produces a Ramsey -R(3, t) graph. For an explicit construction of a triangle-free graph on 
0(£ 3 / 2 ) vertices with independence number t see Alon [2]. 

The methods introduced here can be applied to other processes. In fact, our methods 
immediately suggest an approach to the H-free process for general H. This applies to 
hypergraph processes as well. As an example, we analyze the i^-free process to prove the 
following result: 

Theorem 3. There is a constant C4 such that for t sufficiently large we have 

jS/2 

R{A,t) > c 4 • —5-. 

log t 
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This is a minor improvement on the previously best known lower bound, ((£/ log i) 5//2 ) , 
which was established by Spencer via an application of the Lovasz Local Lemma [22]. 

We analyze the triangle-free process by an application of the so-called differential equa- 
tions method for random graph processes (see Wormald [25] for an introduction to the 
method). The main idea is to identify a collection of random variables whose one-step 
expected changes can be written in terms of the random variables in the collection. These 
expressions yield an autonomous system of ordinary differential equations, and we prove 
that the random variables in our collection (appropriately scaled) are tightly concentrated 
around the trajectory given by the solution of the ode. Recent applications of this method 
include results that link the emergence of a giant component in a random graph process to 
a blow-up point in an associated ode [5], [24], [3] and an analysis of a randomized matching 
algorithm that hinges on the existence of an invariant set in an associated ode [4] . 

We track the following random variables through the evolution of the triangle-free pro- 
cess. Recall that Gi is the graph given by the first i edges selected by the process. The 
graph Gi partitions (y) into three parts: Ei,Oi and Cj. The set Ei is simply the edge 
set of Gi. A pair {u,v} G (^') is open, and in the set Oj, if it can still be added as an 
edge without violating the triangle-free condition. A pair {u,v} G (^) is closed, and in 
the set Ci, if it is neither an edge in the graph nor open; that is, the pair {u,v} is in Ci if 
there some vertex w such that {u, w}, {v, w} G Ei. Note that e^+i is chosen uniformly at 
random from O;. Set Q(i) = |0»|; this is one of the random variables we track. For each 
pair {u,v} G (^) we track three random variables. Let X U)V (i) be the set of vertices w 
such that {u, w}, {v, w} G Oj. Let Y UjV (i) be the set of vertices w such that 

\{{u,w},{v,w}} nOi\ = \{{u,w},{v,w}} HEi\ = 1. 

Finally, let Z u ^ v {%) be the set of vertices w such that {u,w},{v,w} G Ei. Note that if 
Z u ,v(i) 7^ O then we have {u, v} G Cj. We dub vertices in X U)V open with respect to {u, v}, 
vertices in Y u<v partial with respect to {u, v} and vertices in Z UtV complete with respect 
to {u, v}. We track the variables |X njl) (i)|, |l^ )t; (i)| and |Z Ui ^(i)| for all pairs u,v such that 
{u, v} G" Ei. (In fact, we only show that |Z UiU (z)| does not get too large; so we track this 
random variable in the sense that we bound it). We emphasize that we make no claims 
regarding the number of open, partial and complete vertices with respect to pairs {u, v} 
that are edges in the graph. Formally, we set X UjV (i) = X u>v (i — 1), Y u>v (i) = Y u ^ v (i — 1) 
and Z U)V (i) = Z u>v (i - 1) if {u,v} G Ei. 

In order to motivate our main results (Theorems 4 and 5 below) we present a heuristic 
derivation of the trajectory that the random variables Q(i),\X UtV (i)\ and [^^(i)) should 
follow. We stress that this discussion does not constitute a proof that the random variables 
follow this trajectory; the proof itself comes in Section 3 below. We begin by choosing 
appropriate scaling. We introduce a continuous variable t and relate this to the steps Gi 
in the process by setting t = t(i) = ijrc'l' 1 . Our trajectories are given by three functions: 
q(t),x(t) and y(t). We suppose Q(i) is approximately q(t)n 2 , \X UiV (i)\ is approximately 
x(t)n for all {u, v} G (^') \E% and |l^ )t ;(i)| is approximately y(t)y/n for all {u, v} G (^) \Ei. 
Consider a fixed step % in the graph process and let e > be sufficiently small. We suspect 
that the changes in our tracked random variables are very close to their expected values over 
the ensuing en 3//2 steps of the process and use this guess to derive our system of differential 



3 



equations. We begin with |Oj|. Note that if e^+i = {u,v} then there is exactly one edge 
closed for each vertex that is partial with respect to {u,v}; in other words, if e^+i = {u, v} 
then Q(i + 1) = Q(i) — 1 — |y u ^(i)|. Therefore, we should have 



q(t + e)n 2 « Q [i + en 



.3/2 



Q(i) 



en 



3 l 2 .y{t)n l l 2 ^(q(t)-ey{t))n 2 . 



This suggests dq/dt = —y. Now consider the variable Consider a fixed vertex 

w that is open with respect to {u,v}. Note that the probability that the edge e^+i closes 
{u, w} (i.e. the probability of the event {u, w} G Ci+i) is |^w,iu|/|Oj|. As the probability 
that ej + i G {{«, w}, {v, w}} is comparatively negligible, we suspect that we have 



x{t + e)n 



x(t)- 



i + en 



3/2 



en 



3/2 _ 



x(t)n 



q(t)n 2 



9(0 



This suggests dx/dt = —2xy/q. Finally, we consider ^^(i)!. First note that a vertex that 
is partial with respect to {u, v} has its one open edge closed by e^+i with probability nearly 
y(t)^/n/ (q(t)n 2 ). The probability that a vertex that is open with respect to {u, v} becomes 
partial with respect to {u, v} is 2/Q(i). So, we should have 



y{t + e)^/n; 



i + en 



3/2 



which suggests dy/dt = —y 2 /q + 2x/q. As |Oo| = n(n — l)/2, |A" n ^(0)| = n — 2 for all pairs 
{u,v} and 11^(0)1 = for all pairs {u, v}, our expected value computations suggest that 
our random variables should follow the trajectory given by 



dq 

Tt=~ y 



dx 
~dt 



2xy_ 
Q 



dy 
dt 



y 2 2x 
-— + — 

q q 



(i) 



with initial conditions q(0) = 1/2, x(0) = 1 and y(0) = 0. The solution to this autonomous 
system is 



-4t* 



x(t) 



-8t 2 



y(t) = Ate 



-4P 



(2) 



Note that if \Oi\ indeed follows q(t)n 2 then the triangle-free process will come to end at 
t = ©(vdog n) ; that is, the process will end with Q(-\/\ogn ■ n 3 ^ 2 ) edges. (It was Peter 
Keevash who pointed out that (1) has this tantalizing solution [15].) Observe that the 
functions q(t),x(t),y(t) are the appropriate values for G chosen uniformly at random from 
the collection of graphs with n vertices and in 3 / 2 edges. 
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We introduce absolute constants p, j3, 7 and p. The constants p and p are small, (3 is a 
large relative to and 7 is large relative to both p and j3. (These constant can take values 
p = p = 1/32, (3 = 1/2 and 7 = 161. No effort is made to optimize the constants, and we 
do not introduce the actual values in an attempt to make the paper easier to read). Set 

m = /xydog n • n 3 / 2 . 

Our first result is that our random variables indeed follow the trajectory (2) up to m random 
edges. In order to state this concentration result we introduce error functions that slowly 
deteriorate as the process evolves (in the language of Wormald [25] we employ 'the wholistic 
approach' to the differential equations method). Define 

{ 41t 2 +40t if / < 1 
% 2+40t . f " /,(*) = e 37t2 + 40 < f y (t) = (3) 

and set 

g q (t) = f q (t>~ 1/6 9x{t) = f x (t)n- l/& 9y (t) = f y (t)n- l '\ 

Let Bj be the event that there exists there exists a step i < j such that 

|Q(i) - q(t)n 2 \ >g q (t)n 2 

or there exists some pair {u, v} G (v) \ sucri that 

\\X U)V (i)\ - x{t)n\ > g x {t)n or | \Y UyV (i)\ - y(t)y/n\ > g y (t)y/n or \Z UjV (i)\ > log 2 n. 
Theorem 4. If n is sufficiently large then 

Pr (Vi^3/ 2 ) <e-^ n . 

Note that Theorem 4 alone places no upper bound on the number M of edges in the graph 
produced by the triangle-free process. In order to achieve such a bound, we bound the 
independence number of G m . 

Theorem 5. If n is sufficiently large then 

Pr (« (Gwi^-n 3 / 2 ) > 7V 7 ™ log n I B^j < e~ nl/ \ 

Since the neighborhood of each vertex in the triangle-free process is an independent set, it 
follows immediately from Theorem 5 that the maximum degree in Gm is at most 7V n log n 
a.a.s. Thus, we have proved Theorem 1. 

The remainder of the paper is organized as follows. In the next section we establish some 
technical preliminaries. Theorems 4 and 5 are then proved in Sections 3 and 4, respectively. 
The proof of Theorem 3 is given in Section 5. 
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2 Preliminaries 



Our probability space is the space denned naturally by the triangle- free process. Let f2 = Q n 
be the set of all maximal sequences in (^) with distinct entries and the property that each 
initial sequence gives a triangle- free graph on vertex set [n]. We stress that our measure is 
not uniform: it is the measure given by the uniform random choice at each step. We always 
work with the natural filtration Tq C T\ C • • • given by the process. Two elements x, y of 
Q are in the same part of the partition that generates J-j iff the first j entries of x and y 
agree. We use the symbol Uj to denote one of the parts in this partition (i.e. Uj denotes 
a particular history of the process through j steps); in particular, if uj G f2 then ujj is the 
part of the partition that defines J-j that contains u. 

For the purpose of notational convenience we use the symbol '±' in two ways: in interval 
arithmetic and to define pairs of random variables. The distinction between the two should 
be clear from context. The degree of a vertex v in Gi is denoted di(v) and the neighborhood 
of v in G{ is Ni(v). 

Our main tool for establishing concentration is the following version of the Azuma- 
Hoeffding inequality. Let r/,N > be constants. We say that a sequence of random 
variables Aq, A±, . . . is (77, iV)-bounded if 

Ai - 77 < A i+ i < Ai + N for all i. 

Lemma 6. Suppose 77 < N/2 and a < rjm. If = Aq,A\,... is an (ij, N) -bounded sub- 
martingale then 

a? 

Pr[A m < -a] < e 3 " mJv . 

Lemma 7. Suppose 77 < N/10 and a < mq. If = Aq,A\,... is an (77, N) -bounded 
supermartingale then 

a? 

Pr[A m > a] < e 3 " mAr . 

As the author failed to find a reference for these particular inequalities in the literature, 
proofs are given at the end of the paper, in Section 6. We often work with pairs A^ , A 1 , . . . 
where Aq ,Af , ... is an (77, iV)-bounded submartingale and Aq , A^ , ... is an (77, A r )-bounded 
supermartingale. We will refer to such a pair of sequences of random variables as an (77, N)- 
bounded martingale pair. 

3 Trajectory 

Here we prove Theorem 4, which establishes tight concentration of the random variables 
\0%\> \X U:V (i)\ and around the trajectory given in (2) and bounds |Z Ui „(i)|. 

Recall t = t(i) = i/n^l 2 and m = /j-^/log n ■ re 3 / 2 and 

{ 41i 2 +40t„-l/6 if y- < 1 
e41t2+40t 1/6 ' - 1 9 x (t) = e™ 2 +^n-^ {t) = e 4^ + 40t n -i/6. 

e _ n -l/b if t > 1 

Note that 

g q < j and g x = g y . (4) 
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We define events X, y and Z. For uj £ B m let £ be the smallest index such that uj £ Be 
but uj Bi-i] in other words, the random variable I is the first time that one of our tracked 
random variables is outside the allowable range. We define X to be the set of uj 6 B m such 
that there exists a pair {u, v} such that {u, v} Eg and 



So, an atom uj S B m is in X if there is some pair of vertices {u, v} such that the number of 
open vertices with respect to {u, v} is a reason we place uj S Bg. Define y and Z analogously. 
We prove Theorem 4 by showing 



and then bounding the probabilities of X, y and Z. In the next subsection we show that if 
i s m ran g e f° r all j < i and all pairs {u, v} then \0{\ is in range, thereby establishing 
(5). In the following three subsections we establish upper bounds on the probabilities of 
the events X, y and Z, respectively. 

3.1 Open edges 

Here we simply take advantage of the strict control we enforce on the number of partial 
vertices at each pair; we do not invoke any concentration inequalities in this subsection. 
Note that if we have e^+i = {u, v} then the number of edges closed when we add ej+i 
is simply equal to ^^(i)!, the number of partial vertices at {u, v}. Therefore, assuming 
uj Bj-i, we have 



X u , v (£)\ g n[x(i(*))± 



xuyuz 



(5) 




i=0 




2 



_ "i-i 

j~Vn ^Ty(t)±g y (t) 



li=0 





o Jo 



Cn 2 [q(t(j)) ± 9Mj))} ■ 



Note that this establishes (5). 



3.2 Open vertices 

Consider a fixed {u, v} G (^)- We write 



Xu,vU)\ 



= n — 



i=l 
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where A{ is the number of open vertices at {u, v} that are eliminated when ej is added to 
the process. Define Af and A~ by 



A 



± 



± (lit + 39)g x (t) 



if uj G" and {it, i2j 
if a; € Bi^i or {u, u } G Ei 



i=i 



Note that if uj G" Z3j_i and {u, -Ej then we have 

~2x(t)y(t) 



\X l , v (j)\=n-2-^2Af-Y J 



<n- Bj 



i=l 



n 



fw) ir+ »v«r (17r+3!)t 

Jo q( t ) Jo 



{17t + 39)g x (t) 
t(j) 



1 



n 

37r 2 +40r 



o q( t ) 

<nx{t{j))+n^ ( e 37 ^ +40t ^ 



dT 



b: 



n[x(t(j))+g x (t(j))}-(B++n 5 / 6 y 



Therefore, the event \X UtV (j)\ > n [x(t(j)) + g x (t(j))] is contained in the event B^ < — n 5 / 6 . 
Similarly, the event < n[x(t(j)) — g x (t(j))] is contained in the event BJ > re 5 / 6 . 

We bound the probabilities of these events by application of the martingale inequalities. 

Claim 8. Bf,Bf,... is a -bounded martingale pair. 

Proof. We begin with the martingale condition. Of course, we can restrict our attention 
to uJi such that uJi % Bi and {u,v} G" Ei. Consider a vertex w G X uv (i). Note that 
w X u<v (i + 1) if e^+i G {{u, w}, {v, w}}, e^+i connects {u, w} to one of the vertices that 
is partial at {u, w} or ej+i connects {v, w} to one of the vertices that is partial at {v, w}. 
Note that (as we assume {u, v} G" Ei) the edge ei+i plays 2 of these roles if and only if 
e,; + i = {z,w} where z G Z U)V (i). It follows that we have 



Pr (w<£X u>v (i + l)) 



2 + \Y UtW (i)\ + \Y VtW (i)\ - \Z UtV (i)\ 

\o,\ 



and therefore 



E[A i+1 | Ti\ 



\Oi\ 



2 + \Y UjW (i)\ + \Y v 



t»eXu,»(i) 
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As we restrict our attention to cjj % Bi, we have 

n{x + g x )\og 2 n 2n(x + g x 



E[A 



t+1 



2n i l 2 (x±g x ){y±g y ) 

J~ i t n / \ ~T" 



?? 



2 (<z±s 9 ) 



! (<? - 9q) ' ™ 2 (<? - ffg) 



c 



c 



c 



1 



Ixy ( 2g y x + 2g x y + 2g x g y 



q-g q 



+ 



2xyg q 
q(q - 9q] 



+ 



log 2 n 



n 



Ax 



2xy 



2xy 



5e- 4t2 g y + 17tg x + 5^e 4f2 + 33te~ 4t2 g q 



1 4x\ 
n q ) 

At 2 
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±{YItg x + mg x 



(Note that we apply (4).) This establishes the martingale condition. 

Now we turn to the bounds on Af +1 . We use the simple fact that the set of edges closed 
when we add is determined by Y ei+1 (i); one edge in each partial triangle in Y ei+1 (i) is 
closed. Therefore, the maximum value of A4 is bounded above by (y(t) + g y (t))y/n, which 
is at most y/n. Of course Af takes its smallest value when Ai = 0, and in this case we have 
Af > -4/ y/n as 2xy/q < 4/y/e. □ 



Applying Lemmas 6 and 7 we have 



Pr{B+ < -n 5 / 6 ),Pr(B- > n 5 / 6 ) < e"% 



5/3 



(6) 



We claim that X is contained in the union, taken over all pairs {u, v}, of the events given 



in (6). Indeed, if u G X on account of the the pair {u, v} at step £ then either Bt < 



-11 



5/6 



or Bg > n 5//6 and we also have BJ = 
in the event Bj). Therefore, we have 



BJ and BJ = BJ for all j > i (as we set Af +l = 



Pr(X) < 2 



n 



e 12m 



3.3 Partial vertices 

We use the same reasoning as in the last subsection, but here we break the step by step 
changes in ^^^(z)! into two parts. We write ll^C/)! as a sum 



\YuAj)\ 



1=1 



where Ui is the number of partial vertices at {u, v} created when ej is added and Vi is the 
number of partial vertices at {u, v} eliminated when ej is added. Note that if {u, v} € Ei 
then we set Ui = Vi = (in order to maintain consistency with the definition of Y u>v ). 
We begin with an analysis of Vi. Define = and 




^ ± (82* + l)g y (t)\ i if uj Bi-i and {«, v} $ Ei 

if uj G Bi-i or {u, v} £ Ei 

j 



En 

i=l 



± 
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Claim 9. W^.Wf is a (-,log n) -bounded martingale pair. 



Proof. We begin with the martingale conditions. Suppose w is partial with respect to {u, v}. 
Let w* be the unique vertex in {it, v} such that {w* , w} G Oi. Note that w is removed from 
X UjV if either e^+i = {w, w*} or e$+i is one of the pairs in Oi that links {it;*, w} to Y w * >w (i) 
(other than {it, v} itself). Therefore, restricting our attention to u>i % Bi, we have 



E [V i+1 | Ti\ 



\Y * 

I W l 

la 



As we restrict our attention to cjj ^ and {it, v} G" -Ej we have 

y/n(y ± g y ) (y/n(y ± g y )) 



E [V i+ x | Ti\ G 



1 

C - 

n 

1 

C - 

n 



n 2 (q±g q ) 



q- g q 



+ 



y 2 g q 



q{q - 9 q ) 



— ± (Yltg y + g y + 65^) 



and the martingale conditions are established. 

It remains to establish boundedness. Note that if e^+i does not intersect {it, v} then the 
change in |YJt jt ,(i)| is at most 2 (as all edges that are closed when we add ef+i intersect ej+i). 
So, suppose ej+i = {it, z} where z ^ v. If the vertex to is then removed from Y u>v then the 
edge {w, it} must have been closed by ej+i. This implies {w, z] G Furthermore, as it; 
is partial with respect to {u,v}, we have {u>,i>} G Ei. Thus it; G Z z ^ v {i). Therefore, the 
change in V{ is bounded by the maximum value of \Z Xjy (i)\, which is bounded by log 2 n. 
The lower bound follows from y 2 /q < 8/e. 

Applying Lemmas 6 and 7 we have 

n 2/3 

Pr(W+ < -n 1/3 /2),Pr(W~ > n 1/3 /2) < e 4 8™^"/». 
and 



□ 



Now we turn to E7j. Define 




if a; G" and {it, v} G" -Bj 
if lu G Bj-i or {it, u} G i% 



± 



Claim 10. T^lf 



i=l 



is a (— ,1) -bounded martingale pair. 



Proof. We begin with the martingale conditions. As usual we restrict our attention to 
Ui % Bi. We have 

, i 2\X u Ji)\ 
E[U i+1 \F i }= 1 gp , 
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and 



E [U i+1 | Ti\ e 
c 



2n(x ± g x ) 
1 



n 



2x 2g g x + 2qg, x 



1 

C - 

n 

1 

c - 

n 



q ' q{q-g q ) 



- ± u 9y 
q 



□ 



which establishes the martingale conditions. 

As the addition of ej+i to the graph can create at most one new partial vertex at {u, v}, 
Ui is either 1 or 0. Furthermore, 2x/q = 4e _4t < 4. These two observations establish the 
boundedness condition. 

Applying Lemmas 6 and 7 we have 

Pr(T+ < -n 1/3 /2),Pr(T~ > n 1/3 /2) < 



i/3 



Now we are ready to return to the random variable itself. We have 

3 

\Y u , v (j)\=^2Ui-Vi 



j 



J l 
4^ n 



i=l 



2x 



U 9y 



+ (82t + 



^-"7 + ;E(t-?)-;E1« + % 

Z = l 2=1 

Jo 



q q 

>V^[y(t(j))-9y{t(j))]+n iri + T, 



l<3) , / 

y (82r + 15)e 41r +40r dr + (T+ - 



J "3 

Therefore, the event |l^ iJ; (j)| < y/n[y(t(j)) — g y (t(j))] is contained in 

We have already bounded the probabilities of these events. The analogous argument holds 
for the event |l^ iJ; (j)| > y/n(y(t(j)) + g y (t(j))), with and W~ replaced with TJ and 
Wj~ , respectively. As the random variables T i and Wf~ are 'frozen' once one of the random 
variables leaves the allowable range, we have 



Pr Q?) < 



fn\ ( 2 „i/6, 

/ J . 2 f e 4s™i°g 2 «/« + e «^7^ j < 2n e ' 



11 



3.4 Complete vertices 



Note that the probability that a+i adds a complete vertex at {u, v} is at most |Y U! „(i)|/|Oj|. 
So, in the event Bi, we have 

Pr (M + 1)| _ \Z„(i)\ + 1) < < 

n 2 {q{t) - g q (t)) n^ 2 

Therefore, 

Pr [\Z u , v (m)\ > log 2 n] < (^ff^) (^^f^ < e^******* 
for n sufficiently large. Thus 

Pr(Z) < ( U ) e ^|( lo g 2n ) lo g lo s n . 



4 Independent Sets 

Our goal is now to prove Theorem 5. We will bound from above the probability, conditional 
on B m , that any fixed set K of 7 V n log n vertices is independent. This bound will be so 
small that it remains small when multiplied by the number of such K. The conditioning 
on B m tells us that the variables Q, X UiV ,Y U)V all remain quite close to q(t)n 2 , x(t)n, y(t)y/n 
throughout the process. As it happens, the strength of the error terms g q ,g y ,g x does 
not play a major role in the calculations below. The reader might, at first reading, set 
g q = g y = g x = so as to get a less cluttered view of the techniques involved. 

Recall that /x, j3, 7 and p are constants where \jl and p are small, (3 is large relative to 
p and 7 is large relative to p and (3. Also recall m = p\/Togn ■ r?l 2 . We make 2 initial 
observations (Claims 11 and 12). Let T>i be the event that Gi has a vertex of degree greater 
than [3\Jn logn. 

Claim 11. If n is sufficiently large then 

Pr(V m A^) < e~ nl/5 . 

Proof. We begin by establishing an upper bound on the number of open pairs at each vertex. 
For each vertex v let W v (i) be the set of pairs in Oi that contain v. Let Ai be the number 
of open pairs that contain v that are removed from W v when the edge ej is added to the 
process. Note that we have 



E[A i+1 \Fi}= Yl 

w&W v (i) 



l + \Y v>w (i) 
1 0,:l 



Define 




( 8te~ At - 20g y ) if W v (i) > e~ At n and w B t 



if W v {i) < e~ 4 ' norweB. 
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Note that (restricting our attention to uji % B{ and W v (i) > e u 'n) 



E [B l+1 | H > e^n ( ^f^-^ ) ~ 4= " 

> ^ - - 4= f 8te " 4t2 " 20 ^ 

V<? Q + 9 q q{q + g q )J V nK 

> -= (-3g y - 17tg q + 20g y ) 

> o. 

Therefore, any sequence of the form Bi, Bi+B^+i, . . . , Yl\=i Sj, ... is a (2/ y/n, y / n)-bounded 
submartingale. Therefore, for any I < j we have 



Pr ^B 4 < -n 7/8 < e - ^ 



7/4 
6m 



Now consider the event |W v (j)| > e 4t (-?') 2 n+2n 7//§ . In this event there exists a maximum 
£ < j such that \W V (£)\ < e" 4 *W n. We have 

^2 A< (V 4 *W 2 - e- 4 *^) 2 ) n - 2n 7 / 8 , 

which implies 

£ B, < (e- 4 *W 2 - e" 4i ^) 2 ) n - 2n 7 / 8 * £ ( 8te " 4 * 2 " 20 ^) 
i=e+2 * i=e+i 

3 /'^^ 
< --n 7/8 + n / 20g y (T)dr 
2 JtU+l) 



< -n 



7/8 



Let PL be the event that there exists a vertex v and a step j < m such that |W v (j)| > 
e _4 *u) n _|_ 2n 7 / 8 . We have shown 



Pr ip' m A £ m ) < n(™j exp {-n 7/4 /(6m)} . 



So, we can restrict our attention to the event Note that here we have |W v (j)| < 
A\Oj\/n for all j,v. Now we simply use the union bound. 



Pr (V m A B m A V' m ) < n 



\ P\/n logn / \n J 



( uJlogn ■ n 3 / 2 • 4e \ / uAe 

— n ^=^= = n — — 

\ (3 V n log n-n I \ p 



□ 
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Next we consider the number of open pairs in sufficiently large bipartite subgraphs. Let 
A, B be disjoint subsets of [n] such that 



\A\ 



\B\ 



7-/3 



■\/ n log n = k. 



We track the evolution of the number of pairs in Oi that intersect both A and B. Note that a 
vertex with large degree in either A and B can cause a large one step change in this variable. 
To deal with this possibility, we introduce the following definition. Let A x B be the set of 
pairs {u,v} G (^) that intersect both A and B. We say that the pair {u,v} G A x B is 
closed with respect to A, B if there exists x^iUB and j < i such that 

k 



\N j (x)nA\,\N j (x)nB\ < 



nP 



and 



u, v G Nj(x). 



A pair {u, G A x i? is open with respect to A, B if {u, v} G" and {u, v} is not closed 
with respect to A,B. Define 

W-A.B W = f } G ^4 x i? : {u, v} is open with respect to A, B in Gj} . 

Note that a pair {u, v} £ A x B can be closed (i.e. in d) and still be in WA,B(i)- We 
stop tracking Wa,b as soon as a single edge falls in iUfi; formally, if Ei n (^2^) / $ or 
cjj C Pj V Si then we set Wa,b(z) = Wa,b(i — 1)- 

Let be the event there exist A,B£ (v) and a step i < j such that 

2 

Claim 12. 7/n is sufficiently large then 

Pr (P ro A^) <e" nl/2 . 
Proof. Let be the number of pairs that leave Wa,b at step z of the process. We have 



and | Wab (*)l < e " 4 * 2&2 " 2n x -"/ 3 



E 



Note that we only have an upper bound here as there may be edges between {u, v} and Y UjV 
that would close {u, v} without removing {u,v} from Wa,b- Define 



Yi- 



Note that 







logn ( {~1-P) 2 



8te 



-At* 



+ 20g y 



if \W A ,B{i)\ < e~ it2 k 2 and u & Bt 
if \W A , B {i)\ > e" 4 * 2 A; 2 or u G tfj 



E[Y i+ i \ Si\ <e k 1 



< 

< 
< 0. 



n 2 (q- g q ) 



8te" 4 ' 2 + 20a 



4 / V n 
(7-/3) 2 \ logn 



0V 



+ 



5 9 y 



- + 

q q-g g q(q- g q ) 



8te 



-4t z 



+ 20g y 



{3g y + Y!tg q - 20g y ) 
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Therefore, Xt, Yi + Yg + i, . . . , YH=e 5^, • • • is a ( 27 ^ - , /era p )-bounded supermartingale. It 
follows that we have 



Pr 



> ra 1 "^ 3 



n 2-2p/3 n l/2+p/3 

< exp < -j- > = exp < — — 5— > . (7) 

37 d m log 137 n ■ n~P I 3/i7 d log n I 



Now we turn to the event Vj ABj, where we assume that it is step j where |Wa,b(j)| is 
too small for the first time. There exists a maximum £ < j such that \W A)B (t)\ > e~ 4t ^ 2 k 2 . 
Then 



\W A MJ)\ > ^ m e - X i 

i=i+l 

3 /(„ a\2\ 3 



i=l+2 V \ / i= i +2 

> e ~ 4 *(i)V _ y Yi -k 2 20g y (T)dT - y^. 



Therefore, applying (7), we have 



2 / „,„ , N 2 n V2+p/3 



Pr (V m A B m ) < Q • (n 3 / 2 7ioi^) 2 • exp 



log 2 n 



□ 



Consider a fixed set X of ^y/n logn vertices. We bound the probability that -fT is 
independent in G m by first showing that if K is independent in Gi (and we are not in the 
'bad' event Bi V T>i V "Pi) then the number of pairs in (^f) n Oi is at least a constant time 
e~ it2 \K\ 2 . This implies that the edge e^+i has a reasonably good chance of falling in K. 

We restrict our attention to B m A V m A P™. For each step i of the process such that 
(^) n E'j = let Li be the set of vertices x such that x £ K and |A^(x) fl > k/n p . Set 

ATj = {iVi(x) n K : x £ Li} . 

We first note that, since co-degrees are bounded when we are not in the event Bi, we have 

X,YeMi |XnF|<log 2 n. 

It follows that the cardinality of the union of / sets in A/i is at least fk/n p — f 2 log 2 n, and 
therefore 

| Li | < 2n p . 

Furthermore, as we restrict our attention to T>i , we have 



X £ Mi => \X\< P^nlogn. 
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Now, we identify disjoint sets A, B such that the set of pairs A x B is essentially disjoint 
from ('2) for all X 6 Mi. Form A C K such that \A\ = k by iteratively adding sets from Mi 
for as long as possible. Let B O K \ A have the property that \B\ = k and B n X = for 
all X E A/i that are used to form A Note that we have 

XeMi => I A" nA| < log 2 n • 2n p or |XnB|=0. 

Note that the number of edges in WA,B(i) that are in C% is at most 

\Li\ (2 log 2 n-n p ) ■ (3^n log n < 4(3 log 5/2 n • n 1/2+2p . 

Therefore, since uJi%Vi, 



> e 



> e 



-4^ 



-4^ 



(7 ~ P? 

4 

(7 - 



J n log n - 2n 1 " p/3 - 4/3 log 5/2 n ■ n l/2+2p 



■ n log n. 



Thus, since u>i <2 &i, 



Pr [ e i+ i £ 



> 



(7-/3) 2 logn 
6n 



and the probability that K remains independent is at most 



1 



(7-/3)^logn\ pv 



6ri 



< 



cxp 



(7 - l) 1 

6 



-/' 



log 3 / 2 



n • \/n 



On the other hand, the number of 7-y/n log re-element sets of vertices is 



11 



< 



lie 



y'j^n log re J \7 V / nlogn / 
Theorem 5 now follows from the union bound. 



< exp \ — log 3 ^ 2 re ■ v^j • 



5 The K^-free Process 

We prove Theorem 3 by analyzing the i^-free process on re vertices, showing that it produces 
a graph with independence number 0(log 4//5 n ■ n 2 / 5 ). 

As in the analysis of the i^-free process, we let Ei be the set of edges chosen through 
the first i steps in the process, Cj C (^) be the set of forbidden pairs in G\ and Oi C 
be the set of available pairs in Gj. 

We track the following random variables through the evolution of the i^-free process. 
Let Q(i) be |Oj|, the number of open pairs in fl-fy after i steps of the process. For A e (j 1 ^) 
and / e {0, 1, 2, 3, 4} let X A j(i) be the collection of sets B £ (M) such that 



/ and Cj n 



2 



C 
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Furthermore, for / £ {0, 1,2,3} and A G ('g') let Y A j(i) be the set of vertices v such that 
\E i n(Ax{v})\=f and Q PI S> ) C ' 



2 y " V 2 

Of course, the random variables Xaj are the variables we are most interested in tracking; 
the variables Yaj are introduced in order to maintain bounds on the one-step changes in 
the variables that comprise X A .f- We stop tracking the variables once ( 2 ) C Ei, formally 
setting Xaj(i) = X A j{i — 1) and Ya,/W = ^A,f{i — 1) i n this situation. Our scaling is 
given by t = t(i) = i/n s ^ 5 . 

We introduce functions q(t), Xf(t) for / = 0,1,2,3,4, and y/(t) for / = 0,1,2. Our 
guess for the purpose of setting up the differential equations is the following 

Q(i) « q(t)n 2 \X AJ (i)\ « x f (t)n 2 - 2 -f \Y AJ (i)\ « y f {t)n 1 -^. 

This leads to the system of differential equations 

dq _ dx Q _ 5x x 4 dx f _ (6 - f)xf-i (5 - f)x f x 4 , , _ -, q „ 

at dt q dt q q 

with initial condition g(0) = 1/2, a?o(0) = 1/2 and xi(0) = ••• = 2:4(0) = 0. This has 
solution 



x f (t) = 2f- 1 (%fe- 16 ^ t& for / = 0,1,2,3,4 



With this solution in hand, we turn to y/(t). Here we have the equations 
dyo _ Syoxi dy f _ (4 - /)y/_i (3 - f)yfx 4 



dt q dt q q 

with initial condition yo(0) = 1, yi(0) = and 2/2(0) = 0. This has solution 



for / = 1,2 



/ p -16(3-/)i 5 



y f (t) = V [ f We 

Note that this suggests that the i^-free process terminates with O ^n 8 / 5 • log 1 / 5 n^j edges. 

In order to state our stability results we introduce error functions that slowly decay as 
the process evolves. The polynomial p(t) has degree 5 and positive coefficients. We do not 
explicitly define this polynomial; it suffices that its coefficients are sufficiently large. Define 

/, = { e jS f f = e^-^-f» 5 for/ = 0,1,2,3,4 

hf = ( *(t)-16(2-fyfi fo r / = o,l, 2. 
Define B{ to be the event that there exists j < i such that 

\Q(j)-q(t(j))n 2 \>fMj)h 29/15 
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or there is a set A G (^) and / G {0, 1, . . . , 4} such that (2) % Ej and 



.2/ 



,2- 



. 2 /_ 1 



|Xa,/0")I-s/(*0")X~ t >ff(t(j))n 

or there is a set A G ( [ 3 ] ) and / G {0, 1, 2} such that (£) 2 #j and 

> y/(t(i)K~^ + /i / (t(i))n 1 -¥-A 

or there is a set A G ('3') such that (^) % Ej and 

1^,3(7)1 > 15- 

We introduce absolute constants p, p and 7. As in our analysis of the iC^-free process, p 
and p are small relative to p(t) and 7 is large with respect to p. Define m = pn 8//5 log 1 / 5 n. 

Theorem 13. If n is sufficiently large then 

Theorem 14. If n is sufficiently large then 

Pr (a (G Mn8/5logl/5n ) > 7 ra 2 / 5 log 4 / 5 n \ B^) < e~^\ 

The methods introduced in Sections 3 and 4 can be used to prove Theorems 13 and 14. 
This is more or less straightforward and is mostly left to the reader; we conclude this section 
with the details that do not follow immediately as above. 

Proof of Theorem 13. There is one significant difference between the triangle-free process 
and the .fQ-free process that must be dealt with here. In the case of the triangle-free 
process, there is a one-to-one correspondence between edges closed when ei is added and 
vertices that are partial with respect to ej in Gi-\. The analogous correspondence does 
not hold for the i^-free process: Since a pair {u, v} that intersects e, could be a subset of 
B U ej for many sets B G X ei ±(i — 1), there is not a one-to-one correspondence between 
pairs {u, v} closed by the addition of to the graph and X ei ^(i — 1). In order to overcome 
this problem we note that, based on simple density considerations, the difference between 
these two quantities is bounded by n 4 / 15 in the event B m . 

Let e be a sufficiently small constant (This constant is chosen so that \Oi\ > n 2_e 
for all i < m in the event B m ). Set k = n 1 / 5 + 8e and let Aii be the event that there 
exists {u, v} G (^') > distinct vertices w\, . . . , wu £ [n] \ {u, v} and distinct Z\, Z2, ■ ■ ■ , Z2k £ 
[n] \ {u, v, z\, z 2 , ■ ■ ■ , z 2 k} such that 

{u,Wj},{u,Z2j-l},{u,Z2j},{v,Z 2 j-l},{v,Z2j},{Wj,Z 2 j-l},{Wj,Z2j} G Ei for j = 1, ...,k. 
Claim 15. // n is sufficiently large 

Pr (M m AE^ <e-" 1/5 . 



IS 



Proof. 

Pr (M m AB^) < 



Tl\ Oh 71- ( 1 

k, 



n 2 - ' \-n 2k -m 7k 



en 



3 • (/in 8 / 5 log 1/5 n) 7 




7k 



k 



□ 

Now let I = n 6e and let Mi be the event that there exist vertices u, v, z and disjoint sets 
A, B e i} n }) such that A C Ni(u) n Ni(z) , B C 2Vj(it) n iVi(u) and G; has a matching of I 
edges in A x B. 

Claim 16. If n is sufficiently large then 

Pr(M m AB^) < e~ n5e . 

Proof. 

Pr (Mm) < n 3 ■ r) ■ n e ■ m 5e ' 



< rT 



en 



■ (pn 8 / 5 log 4 / 5 nf 



I ■ n 10 " 5e 



□ 



Now suppose uJj-i % V Mj-\ VMj-i- Let W be the set of vertices w such that {v, w} 
is closed by the addition of the edge ej = {u, v} and there exist distinct vertices z w , z' w such 
that {w, z w }, {w, z' w } S X ejt i(j — 1). Note that ujj-\ % Bj-i implies that the number of 
pairs B G X G .^(j — 1) that correspond to a particular vertex w € W is at most 16. Thus the 
difference between \X e .^{j — 1)| and the number of pairs closed by the addition of edges ej 
is at most 32|W|. It remains to argue that W is small. First note that ooj-i % M—i implies 
that each vertex z is in the set {z w ,z' w } for at most 16n 6e vertices w £ W. Let W C W 
be a maximum set such that a,b £ W implies {z a ,z' a } n {zb,z' b } = 0. By the previous 
observation (using 0Jj-\ % A/}-i ) we have \ W'\ > |P^|/(16n 6e ). Furthermore, Uj-i % M-j-i 
implies that \W'\ < n 1 ^ +8e . Thus, the number of pairs closed by the addition of ej is in 
the interval 

\X ejA (j - 1)| - 32 • lGnV 5+14e , \X ejA (j - 1) 

which is sufficient for the proof. □ 

Proof of Theorem 14- As in the proof of Theorem 5, we fix a set K of 7n 2 / 5 log 4 / 5 n vertices 
and show that the probability that K remains independent is small even when compared 
with the number of such sets. We condition on B m and a bound of n 1 / 5+3e on all co-degrees. 
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(This bound on the co-degrees follows from a very simple first moment calculation. We could 
establish a tighter bound using martingale inequalities, but that is not necessary for this 
argument.) 

There are two significant differences between the triangle-free process and the K^-fvee 
process here: the fact that in the latter the addition of an edge ej that is disjoint from K 
could close many pairs within K and the fact that the neighborhood of a single vertex could 
include K as a subset. 

We track the number of open pairs within two kinds of subgraphs. Set 

k = In 2 / 5 1 og 4 / 5 n. 
3 6 

Let A, B G (^) . We say that a pair {it, v} G A x B is closed with respect toixB at 
step j if there exists a step i < j such that {u, v} is among the edges closed by ej = {x, y] 
and either 

(i) ej n (A U B) = {y} = {it} and there exists z ^ A U B such that {v, z} G Xi x y \ ^ and 
|JVi_i(z) n Ni-i(x) n(AU B)\ < n 1 / 5 ^ 36 or 

(ii) a n (A U B) = and ( |JVi_i(ei) n A\ < n 1 / 5 ^" 36 or |iV i _i(e i ) n B| < n 1 / 5 ^" 36 ). 

If the pair {it, v} £ Ax B is neither closed with respect to {it, v} nor in the edge set Ej then 
it is open with respect to A x B . Note that, since we assume co-degrees are bounded 
by re 1 / 5+3e , the change in the number of pairs closed with respect to A x B that results from 
the addition of an edge ej is at most n 2 / 5_p '. It follows from the techniques in Section 4 that 
with high probability we have the following: For all steps j < m and all pairs A, B S (^) 
such that (A x B) D £7j = the number of edges in A x B that are open with respect to 
A x 5 is at least ife" 16 ^) 5 . 

We also track the number of open pairs within sets D consisting of k vertices. We say 
that a pair {u, v} G (?) is closed with respect to D at step j if there exists a step i < j 
such that {n, is among the edges closed by ej = {x, y} and either 

(i) a n D = {y} = {u} and there exists z D such that |iVj_i(z) n N^i(x) D D\ < 

n l/5-p-3e and [ v ^z] £ ^ 

(ii) ei n D = and |JVi_i(z) n A^-i(y) n D\ < n 1 / 5 -^/ 2 . 

If the pair {it, u} G (^) is neither closed with respect to {it, v} nor in the edge set Ej then 
it is open with respect to D . Again following the techniques in Section 4, we see that 
with high probability we have the following: For all steps j < m and all sets D G (v) such 

that (®) n Ej = the number of edges in (^) that are open with respect to D is at least 

e -i6t(i) 5 

4 e 

It remains to show that every set -fT of 7n 2//5 log 4 / 5 n vertices contains: 

(a) Disjoint sets A, B of vertices such the difference between the number of pairs in 
Ax B that are open and the number that are open with respect to A x B is less than, 
say, re 23 / 30 , or 
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(b) A set D of k vertices such the difference between the number of pairs within D that 
are open and the number that are open with respect to D is less than n 23 / 30 . 

A main tool here is the following observation which follows from a simple first moment 
calculation. Let Mi be the event that there exist integers r, s such that s > n 2e , r-s > n 2 ^ +e 
and disjoint sets X G ( [ ™ ] ) and Y G ( [ ™ ] ) such that 

\N m (y) n X\ > s for all y G Y. 

2 /5 

Claim 17. If n is sufficiently large then Pr (Mm) < e~ n 
Now, let Lj be the set of vertices x G" K such that 

Let Lj = {xi,X2,-..} be arranged in decreasing order of \Nj(xg) n K|. A simple case 
analysis in conjuction with Claim 17 now establishes the desired property. 

Case 1. \Nj(xi) n K\ > k. 

Consider D C Nj(xi) K such that |D| = k. Note that, appealing to the bound on 
common neighbors of triples of vertices given by conditioning on B m , all pairs within D 
that are closed but not closed with respect to D are contained in Nj(x\) n Nj(y) n D 
where y G Lj and {x\,y} G The number of such vertices y is at most ra 1 / 5 +p+ 4e by 
Claim 17. Each such neighborhood includes less than n 2 / 5+6e edges because of the bound 
on the co-degrees. Therefore, the number of spoiled pairs within D is at most n 3 / 5 +P+ 10e . 

Case 2. \Nj(xi) n K\ < k. 

Choose 

A Q [j Nj(x e ) n K B Q K \ l\J Nj(x e ) 
e=i \i=i 

such that | A \ = \B\ = k where £' is the smallest index such that the cardinality of this union 
is at least k. 

First suppose £' < ra 2 / 15 . Note that no pairs m Ax B are spoiled in this case: If x, y G" K 
then either N j (x)nN j (y) C A or |iVj(x)niVj(y)n^| < 16n 2 / 15 ( using the bound on common 
neighbors of triples of vertices). 

Finally, suppose £' > n 2 / 15 . Note that, by Claim 17, we have \Nj(xe) n K\ < n 4 / 15+e 
and \Lj\ < ra 1 / 5 +P+ 4(E . Thus, the number of spoiled pairs is at most ra 8 / 15 + 2(E ri 1 / 5 +P+ 4 < E = 

□ 

6 Martingale Inequalities 

Lemmas 6 and 7 follow from the original martingale inequality of Hoeffding. 

Theorem 18 (Hoeffding [14]). Let = Xq,X\, ... be a sequence of random variables such 
that 

Xk-i — Mfe < A fe < X k _i + 1 - fi k 
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for some constant < /i& < 1 for k =1, . . . , m. Set /j, = 2~^fcLi A*fc anc ^ // = 1 — /i. // 
Xq, Xi, ... is a supermartingale and < t < /2 i/ien 



(X m > m*) < 





A* 






/i— t 


m 




_n + t_ 




Jl — t 







(8) 



Hoeffding's result was for martingales, but the extension to supermartingales is straightfor- 
ward. For a survey of applications of this and similar results, see McDiarmid [19]. 

In order to apply Theorem 18 to the martingales considered in this paper, we introduce 
the following function. For < v < 1/2 set v = 1 — v and define 



g(x) = g(x, v) = (v + xv) log 



+ (v — xv) log 



v — XV 



for 



V + XV 

Note that, under the conditions of Theorem 18, we have 

Pr(X m > mxfj.) < e 9 ^ m and Pr(X m > mxp) < e ^~ x ^ m . 
Note further 

"i \ v 
9 (.x) 



1 < x < 1. 



(1 + x)(v — xv) 



Proof of Lemma 6. Let = Aq,A\,... be a (77, X)-bounded submartingale with N > 2ry. 
Let a < 77777. Define X, = —Ai/{rj + N). Note that Theorem 18 applies to Xq, Xi, . . . with 
H = N/(rj + N). Thus 



Pr(A m < -a) = Pr[X m > 



7] + N 



<exp<g 



run 



,/x m 



It remains to bound g(x). Note that if — 1 < x < then g"(x) < —v. As g(0) = g'(0) = 0, 
it follows that g(x) < —vx 2 /2 for —1 < x < 0. Therefore, 



Pr(A m < —a) < exp 



T] am 
N + rj 2m 2 rj 2 



< exp 



2mn{N + rj) 



□ 



Proof of Lemma 7. Let = j4o> -^lj ■ ■ ■ be a (77, iV)-bounded supermartingale with N > 10r?. 
Let o < rjm. Define X, = Ai/(jj+N). Theorem 18 applies to Xq,X\, . . ., with (j, = 77/ (rj+N). 
We have 

/a. 



iV(4n > a) = Pr X m > 



< exp <^ 



\ 777,77' 



,/i m 



77 + iV / 

It remains to bound fif(x). Note that for x > we have </'(cc) < —v/(l + x). Since 
5(0) = = 0) this implies 

5(0;) < — u [(1 + x) log(l + x) — x] < V 



X 



2 + 6 12 ^ 



< 



11 



30 



-vx 



Thus 



Pr(A m > a) < exp 



11 



30 mr)(N + 77) 



< 



exp 



3mr/N J 



□ 
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